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Abstract 

Background: The long noncoding RNA MALATI (nnetastasis-associated lung adenocarcinonna transcript 1) is 
described as a potential bionnarker for NSCLC (non-snnall cell lung cancer). Diagnostic bionnarkers need to be 
detectable in easily accessible body fluids, should be characterized by high specificity, sufficient sensitivity, and 
robustness against influencing factors. The aim of this study was to evaluate the performance of MALATI as a blood 
based biomarker for NSCLC. 

Results: MALATI was shown to be detectable in the cellular fraction of peripheral human blood, showing different 
expression levels between cancer patients and cancer-free controls. For the discrimination of NSCLC patients from 
cancer-free controls a sensitivity of 56% was calculated conditional on a high specificity of 96%. No impact of tumor 
stage, age, gender, and smoking status on MALATI levels could be observed, but results based on small numbers. 

Conclusions: The results of this study indicate that MALATI complies with key characteristics of diagnostic 
biomarkers, i.e., minimal invasiveness, high specificity, and robustness. Due to its relatively low sensitivity MALATI 
might not be feasible as a single biomarker for the diagnosis of NSCLC in the cellular fraction of blood. 
Alternatively, MALATI might be applicable as a complementary biomarker within a panel in order to improve the 
entire diagnostic performance. 
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Background 

Lung cancer is the leading cause of cancer death world- 
wide [1] with NSCLC (non-small cell lung cancer) as the 
most prominent subgroup accounting for approximately 
80% of all lung cancer cases. Commonly, the disease is 
detected in late stages resulting in short survival rates, 
whereas for patients with early-stage lung cancer longer 
survival rates could be observed [2]. Thus, the detection 
of lung cancer in early stages when clinical symptoms 
have not yet occurred appears to be a promising oppor- 
tunity to decrease mortality, because in more cases a 
curative therapy might become possible. 
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In principal, biomarkers should be feasible for the de- 
tection of cancer in early stages. Thus, a major aim in 
cancer research is the identification of proper bio- 
markers. Key characteristics of diagnostic biomarkers 
among others are: (i) minimally-invasive to measure the 
biomarker in easily accessible body fluids, (ii) high speci- 
ficity to avoid false-positive results in cancer-free indi- 
viduals, (iii) sufficient sensitivity to detect the tumors, 
and (iv) robustness against potential influencing factors. 

In recent years biomarker research focused on noncod- 
ing RNAs (ncRNAs), in particular microRNAs (miRNAs). 
MiRNAs are small RNA molecules with a length of ~ 22 
nucleotides (nt), playing a central role in the regulation of 
gene expression [3] and acting as tumor suppressors or 
oncogenes in cancer [4] . Several studies show the feasibil- 
ity of using miRNAs as biomarkers in body fluids for the 
diagnosis of lung cancer [5-8]. However, there is a lack of 
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consistent results between studies focused on the identifi- 
cation of miRNAs as biomarkers [9]. Thus, the discovery 
of alternative or complementing biomarkers is essential 

In addition to miRNAs, long noncoding RNAs 
(IncRNAs) are a promising alternative within the group 
of ncRNAs. LncRNAs are commonly described as RNA 
molecules with a length > 200 nt, playing regulatory and 
structural roles in biological processes. As IncRNAs are 
implicated as tumor suppressors and oncogenes [10], they 
might be feasible as diagnostic biomarkers [11]. Currently, 
only few IncRNAs have been described as candidate bio- 
markers in human body fluids [10]. HULC (highly up- 
regulated in liver cancer) is highly expressed in hepatocel- 
lular carcinoma patients and detectable in human blood 
[12]. PCA3 (prostate cancer gene 3) is detectable in urine 
of prostate cancer patients, showing high accuracy [13]. In 
addition, MALATl (metastasis-associated lung adenocar- 
cinoma transcript 1) might be a candidate biomarker for 
NSCLC [14]. MALATl is a well-described IncRNA widely 
expressed in normal tissues [15]. In several human carcin- 
omas MALATl was shown to be upregulated [16], par- 
ticularly in early-stage metastasizing NSCLC. 

The aim of this study was the evaluation of MALATl 
as a blood-based biomarker for NSCLC. The expression 
of MALATl was measured in the cellular fraction of per- 
ipheral human blood and the expression levels of 
NSCLC patients and cancer-free controls of the general 
population were compared. 

Methods 

Study population 

The study was designed according to rules guarding pa- 
tient privacy and with the approval from the ethics com- 
mittee of the Ruhr-Universitat Bochum (No. 3217-08). 
All participants provided written informed consent. 

The cancer group of 45 NSCLC patients consisted of 
21 patients with AdCa (adenocarcinoma) and 24 patients 
with SqCC (squamous cell carcinoma). Participants were 
recruited at the HELIOS Clinic Emil von Behring, Berlin, 
Germany. Tumor staging was performed according to 
the TNM classification of malignant tumors [17]. Cancer 
patients had not been treated by surgery, chemotherapy, 
or radiation therapy before blood collection. The control 
group of 25 cancer-free subjects was drawn from the 
Heinz Nixdorf Recall study, a population-based cohort 
of elderly subjects [18]. Characteristics of the study 
groups are summarized in Table 1. Detailed subject 
characteristics are listed in Additional file 1. 

RNA isolation 

Peripheral blood samples were collected from each partici- 
pant in 9.0 ml S-Monovette EDTA gel tubes (Sarstedt, 
Niimbrecht, Germany) and centrifuged (2000 x g for 
10 minutes) within 30 minutes after collection. The 



Table 1 Characteristics of the study groups comprising 
patients with NSCLC (non-small cell lung cancer), 
subdivided into AdCa (adenocarcinoma) and SqCC 
(squamous cell carcinoma), and cancer-free controls 







1 oiai 




AuLa 




Controls 


N 




70 


45 


21 


24 


25 


Gender 


Male 


48 


30 


15 


15 


18 




Female 


22 


15 


6 


9 


7 


Age (years) 


Median 


69 


68 


68 


67 


71 




Range 


54-84 


54-83 


57-83 


54-81 


56-84 


Smoking status 


Ever 


64 


44 


20 


24 


20 




Never 


6 


1 


1 


0 


5 


Tumor stage 


l/ll 




3 


0 


3 






lll/IV 




42 


21 


21 





cellular fraction was separated from plasma and stored 
frozen until RNA isolation. 

Samples were thawed at room temperature and RNA 
isolation including DNase I treatment was performed 
from 0.5 ml of the cellular fraction using the RiboPure 
Blood Kit according to the manufacturers' instructions 
(Life Technologies, Darmstadt, Germany). 

Quantitative real-time PGR (qRT-PCR) 

TaqMan assays (Life Technologies) were used for quanti- 
tative expression analyses of MALATl (Hs00273907_sl) 
as potential biomarker and of GAPDH (Hs99999905_ml), 
HPRTl (Hs02800696_ml), and RPLPO (Hs99999902_ml) 
as potential reference genes for normalization. Quantita- 
tive real-time PGR (qRT-PGR) was performed using a 
7900 HT Fast Real-Time PGR System (Life Technologies). 
For the reverse transcription reaction 12 \A RNA and 
for the PGR reaction 5 (il cDNA were used as templates. 
Samples were analyzed in duplicate and non-template 
controls were included. For cycle threshold (Gt) estima- 
tion a fixed threshold of 0.2 was used. Gt values > 35 
were considered to be under the detection limit [19] 
and marked as 35 for analysis [20]. Raw Gt values are 
presented in Additional file 1. 

The performance of potential references was analyzed 
utilizing RefFinder [21], a web-based comprehensive 
tool (www.leonxie.com/referencegene.php), including 
the four commonly used algorithms geNorm [22], 
NormFinder [23], BestKeeper [24], and comparative 
AGt method [25], to evaluate the most stable reference 
across study groups. As the geometric mean (GM) of 
several reference genes is more reasonable than a single 
reference gene [22], the GM of potential references was 
calculated. Normalized MALATl levels were expressed 

as AGt, with AGt = Ct(^MALATl) ' Ct(Reference)- 
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Statistical analysis 

Median and inter- quartile range (IQR) were used to de- 
scribe the distribution of MALATl levels. Groups were 
compared using the non-parametric Kruskal-Wallis test 
for continuous variables. Sensitivity and specificity of 
MALATl were determined from receiver operating char- 
acteristic (ROC) curves illustrating the performance of 
MALATl to discriminate the studied groups. In brief, 
NSCLC vs. controls, AdCa vs. controls, SqCC vs. controls, 
and AdCa vs. SqCC were analyzed. The bootstrap proced- 
ure (1000 runs) was used for internal validation of the esti- 
mates in the ROC analyses. 

Potential factors influencing MALATl levels were evalu- 
ated using a linear regression model. Estimates were given as 
P with 95% confidence intervals (CI) and p values. Here, 
values of |3 > 0 indicate a negative association between the in- 
fluencing factor and MALATl levels, values of p < 0 a posi- 
tive association. Logistic regression modeling was performed 
to estimate the odds ratio (OR) with 95% CI for detecting 
NSCLC as a fijinction of normalized MALATl levels. 

Statistical analyses were performed using SAS/STAT 
and SAS/IML software, version 9.3 (SAS Institute Inc., 
Cary, NC). 

Results 

Expression stability of candidate references 

The potential reference genes GAPDH, HPRTl, and 
RPLPO were measured in all samples from NSCLC pa- 
tients and cancer-free controls. Using raw Ct values no 
significant differences between NSCLC patients and con- 
trols could be observed for GAPDH and HPRTl in con- 
trast to RPLPO (p = 0.0002), (Figure 1). Thus, RPLPO was 
excluded from further evaluation as reference gene. 

In order to identify the most stable reference across the 
study groups RefFinder was used to rank the analyzed 
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Figure 1 Scatter dot plots of raw Ct values of candidate 
reference genes. GAPDH, HPRTl, and RPLPO were measured in patients 
witli NSCLC (non-small cell lung cancer; N = 45) and cancer-free 
controls (N = 25). Horizontal bars represent median and IQR. Groups 
were compared using the Kruskal-Wallis test. 



references. Lowest rank represents the most stable refer- 
ence and highest rank represents the least stable reference 
(Table 2). The GM of GAPDH and HPRTl was identified 
as the most stable reference and used for normalization of 
MALATL 

Distribution of MALAT1 in the study groups 

Table 3 depicts the distribution of normalized MALATl 
levels in the study groups. The median normalized 
MALATl value for NSCLC was -0.35 (IQR -1.34; 1.00), 
for AdCa -0.59 (IQR -1.34; -0.25), and for SqCC -0.23 
(IQR -1.31; 1.47), whereas for controls the median nor- 
malized MALATl level was -2.07 (IQR -2.53; -0.83). 

Differences of normalized MALATl levels between 
cancer patients and cancer-free subjects were significant 
for NSCLC vs. controls (p < 0.0001), AdCa vs. controls 
(p = 0.0043), and SqCC vs. controls (p = 0.0001), whereas 
the difference between AdCa and SqCC was not signifi- 
cant (Figure 2). 

MALATl as biomarker of NSCLC 

Using ROC analyses, for NSCLC patients and controls an 
area under the curve (AUC) of 0.79 (95% CI 0.68 - 0.89), 
(Figure 3A), for AdCa patients and controls an AUC of 
0.75 (95% CI 0.59 - 0.90), (Figure 3B), for SqCC patients 
and controls an AUC of 0.82 (0.70 - 0.94), (Figure 3C), and 
for AdCa vs. SqCC an AUC of 0.58 (95% CI 0.42 - 0.76), 
(Figure 3D), were calculated. 

Sensitivity and specificity of normalized MALATl are 
shown in Table 4, calculated due false-positive rates (FPR) 
of 0% (no false-positive test), 4% (one false-positive test), 
and 8% (two false-positive tests), and to the maximum 
Youdens Index (YI = sensitivity + specificity -1), respect- 
ively. A FPR of 4%, representing 96% specificity, resulted 
in 56% sensitivity for the discrimination of NSCLC from 
controls. The sensitivity to discriminate SqCC from con- 
trols is higher (63%) than the sensitivity to discriminate 
AdCa from controls (48%). A FPR of 8% (92% specificity) 



Table 2 Results of reference analysis using RefFinder [21] 
to evaluate the most stable reference across the study 
groups 



Algorithm 



Reference ranking 





1 (most 
stable) 


2 


3 (least 
stable) 


RefFinder (comprehensive 
ranl<ing) 




GAPDH 


HPRTl 


geNorm 


GM* & GAPDH 




HPRTl 


NormFinder 


GM* 


GAPDH 


HPRTl 


BestKeeper 


HPRTl 


GM* 


GAPDH 


Comparative ACt metliod 


GM* 


GAPDH 


HPRTl 


^Geometric mean (GM) was calculated from GAPDH and HPRTl. 
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Table 3 Distribution of normalized MALAT1 levels by median and inter-quartile range (IQR) stratified by potential 
influencing factors 

Total Controls NSCLC AdCa SqCC 







N 


Median 


IQR 


N 


Median 


IQR 


N 


Median 


IQR 


N 


Median 


IQR 


N 


Median 


IQR 


Total 




70 


-0.86 


-2.10; 
-0.05 


25 


-2.07 


-2.53; 
-0.83 


45 


-0.35 


-1.34; 
1.00 


21 


-0.59 


-1.34; 
0.25 


24 


-0.23 


-1.31; 
1.47 


Gender 


Male 


48 


-0.80 


-2.18; 
-0.14 


18 


-2.18 


-2.54; 
-0.64 


30 


-0.35 


-1.30; 
0.97 


15 


-0.35 


-1.44; 
0.83 


15 


-0.36 


-1.25; 
1.32 




Female 


22 


-1.01 


-2.10; 
-0.26 


7 


-1.49 


-2.24; 
-0.92 


15 


-0.35 


-1.37; 
1.54 


6 


-0.84 


-1.34; 
-0.35 


9 


0.26 


-1.37; 
1.97 


Age 


< 

60 years 


11 


-1.93 


-2.69; 
0.25 


4 


-2.77 


-3.58; 
-1.49 


7 


-1.10 


-2.01; 
0.26 


2 


-1.14 


-2.54; 
0.25 


5 


-1.10 


-1.93; 
0.26 




60- 

69 years 


26 


-0.78 


-1.69; 
-0.05 


7 


-1.69 


-2.24; 
-0.83 


19 


-0.41 


-1.35; 
1.00 


9 


-0.89 


-1.34; 
-0.05 


10 


-0.19 


-0.74; 
1.47 




70- 

79 years 


27 


-0.78 


-2.29; 
-0.83 


11 


-2.07 


-2.53; 
-0.61 


16 


-0.29 


-1.37; 
2.27 


8 


-0.57 


-1.37; 
1.72 


8 


0.54 


-1.39; 
2.64 




> 

80 years 


6 


-0.78 


-1.25; 
-0.35 


3 


-0.92 


-2.64; 
-0.64 


3 


-0.35 


-1.25; 
0.24 


2 


-0.06 


-0.35; 
0.24 


1 


-1.25 


-1.25; 
-1.25 


Smoking 
status 


Ever 


64 


-0.86 


-2.12; 
0.10 


20 


-2.15 


-2.53; 
-0.87 


44 


-0.36 


-1.35; 
0.99 


20 


-0.69 


-1.39; 
0.25 


24 


-0.23 


-1.31; 
1.47 




Never 


6 


-1.01 


-2.10; 
-0.61 


5 


-1.38 


-2.10; 
-0.64 


1 


1.54 




1 


1.54 










Tumor size 


l/ll 














3 


-1.10 


-2.17; 
-0.41 








3 


-1.10 


-2.17; 
-0.41 




lll/IV 














42 


-0.29 


-1.34; 
1.32 


21 


-0.59 


-1.34; 
0.25 


21 


-0.15 


-1.25; 
1.47 


Metastasis 
status 


MO 














7 


-0.41 


-1.93; 
1.47 


2 


-0.22 


-1.44; 
1.00 


15 


-0.41 


-2.01; 
1.91 




Ml 














19 


-0.29 


-1.17; 
0.54 


9 


-0.59 


-1.34; 
0.25 


9 


-0.15 


-0.24; 
1.32 


Lympli node 
status 


NO 














3 


0.24 


-2.17; 
1.32 


1 


0.24 




2 


-0.42 


-2.17; 
1.32 




N1-N3 














42 


-0.36 


-1.34; 
1.00 


20 


-0.69 


1 .39; 
0.54 


22 


-0.23 


-1.25; 
1.47 



did not lead to any increase in sensitivities, whereas a FPR 
of 0% (100% specificity) resulted in lowest sensitivities for 
the discrimination of controls from patients with NSCLC 
(47%), AdCa (38%), or SqCC (54%). Use of the maximum 
YI leads to an increase in sensitivity to 81% only for AdCa 
vs. controls, but specificity decreased to 64%. For the dis- 
crimination of AdCa from SqCC a FPR of 5% (95% specifi- 
city, one false-positive test) resulted in 8% sensitivity and a 
FPR of 10% (90% specificity, two false-positive tests) re- 
sulted in 21% sensitivity, whereas using the maximum YI 
resulted in 33% sensitivity and 86% specificity. 

ROC analyses on 1000 bootstrap samples resulted in 
similar cutoffs, sensitivities, and specificities of MALATl 
in comparison to the original analyses. The calculated 
95% CI regarding NSCLC vs. controls and SqCC vs. con- 
trols indicate a good precision of this assessment, 
whereas AdCa vs. controls the 95% CI shows a less pre- 
cision (Additional file 2). 

The application of logistic regression models revealed a 
two-fold increased risk of detecting NSCLC per normalized 



10 



-10 




NSCLC 



p < 0.0001 



p = 0.0043 



p = 0.0001 



••••• 



AdCa 



SqCC Controls 

Figure 2 Scatter dot plots of MALATl levels in NSCLC (N =45), 
AdCa (N = 21), SqCC (N = 24), and cancer-free controls (N = 25). 

Expression values were normalized by the geometric mean of 
GAPDH and HPRTl and expressed as ACt. Horizontal bars represent 
median and IQR. Groups were compared using the Kruskal-Wallis 
test. NSCLC: non-small cell lung cancer, AdCa: adenocarcinoma, 
SqCC: squamous cell carcinoma. 
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1 -Specificity 1 -Specificity 



Figure 3 Receiver operating characteristics (ROC) curves of normalized MALATh The area under curve (AUC) was determined for MALATl of 
(A) NSCLC (non-small cell lung cancer) patients and controls, (B) AdCa (adenocarcinoma) patients and controls, (C) SqCC (squamous cell 
carcinoma) patients and controls, and (D) AdCa patients and SqCC patients. 



MALATl unit. The OR of MALATl was 1.88 (95% CI 
1.26 - 2.83) without adjustment and 2.03 (95% CI 1.30- 
3.16) with adjustment for gender, age and smoking 
status. 

Potential factors influencing MALATl 

The influence of tumor characteristics on MALATl 
levels is shown in Table 5. MALATl is not affected by 
tumor size, metastasis status or lymph node status. 

The impact of potential influencing factors on the ex- 
pression levels of MALATl are shown in Table 6. NSCLC 
showed a significant 1.63-fold (95% CI 0.75 - 2.51) de- 
crease of MALATl (p = 0.0003), whereas the factors gen- 
der, age, and smoking status showed no impact on the 
MALATl levels in human blood. 

Discussion 

NSCLC is commonly detected in late stages of the dis- 
ease. Biomarkers have the potential to detect cancer at 
early stages, facilitating an earlier and therefore more 



curative therapy that ideally results in decreased mortality. 
In NSCLC, Gutschner et al. showed that MALATl regu- 
lates the expression of several metastasis-associated genes, 
e.g. CDCPl (CUB domain containing protein 1) and 
GPC6 (glypican 6), indicating a major role of MALATl in 
disease progression [26]. Additionally, it was suggested 
that MALATl might also regulate other important cellular 
processes in lung cancer [26]. Thus, MALATl is a candi- 
date biomarker for NSCLC [14]. 

For quantitative expression analysis of messengerR- 
NAs (mRNAs) and miRNAs qRT-PCR is considered to 
be the gold standard [27] and the same might be true 
for IncRNAs. However, to produce reliable data in qRT- 
PCR assays the use of appropriate reference genes for 
normalization is an important issue [28] and candidate 
reference genes need to be tested prior to application 
[29]. As no information regarding IncRNAs as references 
were accessible, mRNAs were selected as potential refer- 
ences. HPRTl and RPLPO are well-described reference 
genes for analyses in NSCLC tissues [30] and GAPDH 
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Table 4 Sensitivity and specificity of normalized MALAT1 and number of true-positive, true-negative, false-positive, 
and false-negative tests, calculated for different false positive rates (FPR), i.e., none, one, and two false-positive tests 
and maximum Youden's Index (Yl) 



Cutoff Sensitivity 

(%) 



Specificity 

(%) 



True-positive 
(N) 



True-negative 
(N) 



False-positive 
(N) 



False-negative 
(N) 



NSCLC vs. 
controls 


FPR = 0% 


-0.24 


47 


100 


21 


25 


0 


24 




FPR = 4% 


-0.41 


56 


96 


25 


24 


1 


20 




FPR = 8% 


-0.42 


56 


92 


25 


23 


2 


20 




Maximum 
Yl 


-0.41 


56 


96 


25 


24 


1 


20 


AdCa vs. controls 


FPR = 0% 


-0.05 


38 


ICQ 


8 


25 


0 


13 




FPR = 4% 


-0.35 


48 


96 


10 


24 


1 


11 




FPR = 8% 


-0.42 


48 


92 


10 


23 


2 


11 




Maximum 
Yl 


-1.44 


81 


64 


17 


16 


9 


4 


SqCC vs. controls 


FPR = 0% 


-0.24 


54 


ICQ 


13 


25 


0 


11 




FPR = 4% 


-0.41 


63 


96 


15 


24 


1 


9 




FPR = 8% 


-0.42 


63 


92 


15 


23 


2 


9 




Maximum 
Yl 


-0.41 


63 


96 


15 


24 


1 


9 


AdCa vs. SqCC 


FPR = 0% 


3.37 


8 


100 


2 


21 


0 


22 




FPR = 5% 


2.76 


8 


95 


2 


20 


1 


22 




FPR = 10% 


1.91 


21 


90 


5 


19 


2 


19 




Maximum 
Yl 


1.32 


33 


86 


8 


18 


3 


16 



was already applied for normalization of MALATl [15]. 
However, in this study RPLPO seems to be no feasible 
reference which is in agreement with Falkenberg et al, 
showing that RPLPO is not appropriate as reference gene 
in human blood samples [31]. In this study, GAPDH and 
HPRTl were suitable reference genes, particularly the 
GM of GAPDH and HPRTl showed the best reference 
performance. This is in accordance with Ulivi et al, 
using GAPDH and HPRTl for normalization of mRNAs 
in blood samples of NSCLC patients and controls [32]. 

One key characteristic of proper diagnostic biomarkers 
is the need to be detectable in easily accessible body fluids 
like peripheral blood. In this study MALATl was mea- 
sured in the cellular fraction of human blood, showing 



that this matrix is in principle appropriate for the analysis 
of IncRNAs. Comparable results for the usability of the 
cellular blood fraction were shown for miRNAs [33,34]. 
Commonly, the cellular fraction obtained during plasma 
preparation is discarded, but it might be reasonable to col- 
lect this matrix in biobanks for subsequent biomarker 
discovery. 

In this study, a significant downregulation of MALATl 
in NSCLC patients in comparison to cancer-free con- 
trols was shown. Comparable results were achieved by 
Zhang et al., showing a downregulation of MALATl in 
patients with hepatocellular carcinomas [35]. However, 
MALATl was implicated to play an oncogenic role [10] 
and upregulation of MALATl was observed in several 



Table 5 Analysis of tumor characteristics influencing MALATl levels in human blood 







NSCLC 






AdCa 






SqCC 




P 


95% CI 


p value 


P 


95% CI 


p value 


P 


95% CI 


p value 


Intercept 


-1.23 


-3.49; 1 .03 










-1.23 


-3.41; 0.96 




Tumor size (Reference: l/ll) lll/IV 


1.06 


-1.28; 3.39 


0.3670 








1.52 


-0.82; 3.85 


0.1911 


Intercept 


-0.15 


-1.11; 0.80 




-0.22 


-3.24; 2.81 




-0.14 


1.15; 0.86 




Metastasis status (Reference: MO) Ml 


-0.14 


-1.36; 1.07 


0.8143 


-0.46 


-3.64; 2.72 


0.7655 


0.65 


-0.98; 2.26 


0.4158 


Intercept 


-0.20 


-2.48; 2.08 




0.24 


-4.02; 4.51 




-0.42 


-3.20; 2.35 




Lymph node status (Reference: NO) N1-N3 


-0.04 


-2.40; 2.32 


0.9714 


-0.92 


-5.29; 3.45 


0.6654 


0.57 


-2.32; 3.47 


0.6860 


Values of p > 0 indicate a negative association between the analyzed factor and MALATl levels, values < 0 a positive association. 
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Table 6 Analysis of potential factors influencing MALAT1 
levels in human blood 







P 


95% CI 


P 

value 


Intercept 




-4.64 


-8.93; 
-0.35 




Group (Reference: controls) 


NSCLC 


1.63 


0.75; 
2.51 


0.0004 


Gender (Reference: male) 


Female 


0.22 


-0.66; 
1.10 


0.6210 


Age 


[10 years] 


0.45 


-0.10; 
1.00 


0.1058 


Smoking status (Reference: 
never) 


Ever 


-0.42 


-1.94; 
1.10 


0.5848 



Statistically significant changes are marked in bold. 

Values of p > 0 indicate a negative association between analyzed factor and 
MALAT1, values < 0 a positive association. 



other cancers, e.g. of the breast and prostate [16]. Such dif- 
ferences might be caused by the paradigm that in fact 
MALATl is expressed ubiquitously but fulfills tissue- 
specific functions depending on the cellular environment 
[36]. Commonly, MALATl is analyzed in tissues [14-16], 
whereas in this study and the study of Zhang et al. [35] 
MALATl was detected in blood. Because MALATl was de- 
tected in the cellular fraction of blood, it is unlikely to be 
directly produced by the tumor tissue. Its downregulation 
in blood cells may be an indirect effect of the tumor, e.g., 
on the immune system. The source of MALATl in the cel- 
lular fraction of blood remains unclear. Theoretically, it 
might originate from leucocytes altered by the tumor. How- 
ever, further analyses are needed to evaluate the origin of 
MALATl in human blood. Very recently, it was shown that 
MALATl was detectable in plasma of patients with gastric 
or prostate cancer [37,38]. Thus, it would be reasonable to 
analyze MALATl in plasma of NSCLC patients instead of 
the cellular fraction, because the presence of MALATl in 
plasma might be a direct effect of the tumor, e.g., release of 
IncRNA-containing extracellular vesicles [39]. 

Tani et al. showed that the stability of MALATl varied 
in various cell types and indicated that the half-life of 
MALATl is shorter than the median half-life of mRNA 
[40]. Such decay of MALATl might also prevail in blood 
cells. It is well known that systems like PAXgene, Tempus, 
and RNAlater stabilize mRNAs and miRNAs in whole 
blood samples [41-43]. Thus, the performance of the assay 
was additionally tested in a few available blood samples 
stabilized by PAXgene or RNAlater. In the stabilized blood 
samples MALATl is detectable at lower Ct values corre- 
sponding to larger quantities (data not shown). The results 
implicate that the use of stabilization systems might be 
meaningful for IncRNA analyses in blood. However, this 
assumption needs to be verified in more detail. 

Regarding the key characteristics of an obligatory high 
specificity and a sufficiently high sensitivity of diagnostic 



biomarkers, the candidate biomarker MALATl does not 
fulfill both criteria. Generally, in screening cohorts a high 
specificity is needed to avoid an unacceptably high num- 
ber of false-positive tests that would result in psycho- 
logical pressure and needless intervention for the patients. 
Thus, the sensitivity of candidate biomarkers should be 
calculated at a fixed high specificity level [44]. In regard to 
the relatively small study group the specificity of 96% is 
quite high, particularly as this corresponds to only one sin- 
gle false-positively tested control. On the other hand, the 
calculated sensitivity is too low (56%) for the use of 
MALATl as a single biomarker for the diagnosis of 
NSCLC, particularly for the subtype AdCa (48%). How- 
ever, lower sensitivity could be balanced by the use of sev- 
eral biomarkers in a panel. Theoretically, in an optimal 
panel every single biomarker is characterized by suffi- 
ciently high sensitivity and the necessary high specificity, 
perfectly complement each other in order to obtain super- 
ior diagnostic performance [45] . Thus, it might be reason- 
able to verify MALATl in combination with other 
biomarkers in larger study groups to improve the entire 
diagnostic performance of the biomarker panel. However, 
for the discrimination of AdCa and SqCC, a sensitivity of 
only 8% precludes MALATl as a biomarker for the differ- 
ential diagnosis of NSCLC subtypes. 

Bootstrap analysis showed that the calculated cutoffs, 
sensitivities, and specificities remain stable, indicating 
that the calculated values are appropriate for the dis- 
crimination of patients and controls. 

Regarding the fourth key characteristic of diagnostic 
biomarkers, the results indicate that MALATl values in 
blood are not correlated with tumor size, metastasis sta- 
tus, or lymph node status. However, more cases of early- 
stage metastasizing NSCLC need to be analyzed in sub- 
sequent studies because this study comprises only three 
cases with tumor stage Tl or T2. Additionally, MALATl 
seems to be relatively independent from common influ- 
encing factors like age, gender, and smoking status, indi- 
cating the robustness of the candidate biomarker. These 
observations are in agreement with MALATl expression 
in tissue [46]. However, it has to be clarified if other po- 
tential influencing factors from the multitude of bio- 
logical, preanalytical, and analytical factors show an 
impact on MALATl levels in human blood. 

Conclusions 

MALATl could be detected in peripheral blood, showing 
different expression levels between NSCLC patients and 
cancer-free controls. It was shown that MALATl com- 
plies with key characteristics of diagnostic biomarkers, 
being minimally-invasive, exhibiting high specificity, and 
robustness. On the contrary, the observed sensitivity is 
too low for the use of MALATl as a single biomarker 
for the diagnosis of NSCLC using the cellular fraction of 



Weber et al. BMC Research Notes 2013, 6:518 
http://www.bionnedcentral.conn/1 756-0500/6/51 8 



Page 8 of 9 



peripheral blood. However, it might be reasonable to 
verify the performance of MALATl as a complementary 
biomarker within a panel in larger studies including 
more cases of early-stage metastasizing NSCLC. 
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